Logistic Regression Models for Analysis of Multistage Survey Data
نویسنده
چکیده
are extensively used in analyzing sample survey data to study the relationship between a binary response and a group of independent variables. Due to cost and efficiency considerations, stratified multistage samples are the norm. However, these samples, while efficient for estimation of the descriptive population quantities, pose challenges for model-based statistical inference. This sampling scheme often introduces multilevel correlation among the observations that can have implications for model parameter estimates. For multistage clustered samples, the dependence among observations often comes from several levels. Thus, drawing appropriate inferences from survey data may require complicated modeling techniques and very often, the computation required for this is very time consuming. This paper is focused on model-based analysis for binary data with a structure similar to that of the National Health Interview Survey. It proposes a logistic regression model which fits the between-cluster variation with random effects. The model also takes into account the correlation among small groups of observations within each cluster. An algorithm is proposed that makes the computation feasible for the mixed logistic regression model on large survey data. Generalized estimating equations (Liang and Zeger, 1986) are used in the estimation procedure to accommodate the correlation among the observations within small groups, avoiding problems associated with the use of random effects to model correlations among large numbers of small groups. An adjustment is applied to eliminate the bias in estimation of fixed effects that exists in some procedures for random effects logistic regression noted by Rodriguez and Goldman, 1995.
منابع مشابه
Multilevel Logistic Regression Analysis Applied to Binary Contraceptive Prevalence Data
In public health, demography and sociology, large-scale surveys often follow a hierarchical data structure as the surveys are based on multistage stratified cluster sampling. The appropriate approach to analyzing such survey data is therefore based on nested sources of variability which come from different levels of the hierarchy. When the variance of the residual errors is correlated between i...
متن کاملبه کارگیری مدلهای رگرسیون لجستیک ترتیبی در مطالعات کیفیت زندگی
Background & Objectives: Due to the increasing tendency to measure the quality of life in recent years and the extensive quality of life questionnaires, it is important to determine the appropriate method of analyzing data derived from these studies. The aim of the present study was to introduce ordinal logistic regression models as an appropriate method for analyzing the data of quality of li...
متن کاملPractice of Epidemiology Estimating Model-Adjusted Risks, Risk Differences, and Risk Ratios From Complex Survey Data
There is increasing interest in estimating and drawing inferences about risk or prevalence ratios and differences instead of odds ratios in the regression setting. Recent publications have shown how the GENMOD procedure in SAS (SAS Institute Inc., Cary, North Carolina) can be used to estimate these parameters in non-population-based studies. In this paper, the authors show how model-adjusted ri...
متن کاملPredictive factors for loneliness in female high school students; an unvariate and multivariate logistic regression analysis
Background and aims: Loneliness typically includes anxious feelings. It is particularly relevant to adolescence period. It has effect on physical and mental health. The present study aimed to identify the predictive factors of loneliness among high schools female students. Methods: A cross– sectional survey was carried out among high schools female students in Ilam during the academic year 201...
متن کاملFactors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis
Background: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002